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Abstract. We propose a novel SIR epidemic model which is driven by the 
transmission of infection packets in networks. Specifically, infected nodes generate 
and deliver infection packets causing the spread of the epidemic, while recovered nodes 
block the delivery of infection packets, and this inhibits the epidemic spreading. The 
efficient routing protocol governed by a control parameter a is used in the packet 
transmission. We obtain the maximum instantaneous population of infected nodes, 
the maximum population of ever infected nodes, as well as the corresponding optimal 
a through simulation. We find that generally more balanced load distribution leads to 
more intense and wide spread of an epidemic in networks. Increasing either average 
node degree or homogeneity of degree distribution will facilitate epidemic spreading. 
When packet generation rate p is small, increasing p favors epidemic spreading. 
However, when p is large enough, traffic congestion appears which inhibits epidemic 
spreading. 


TAGS numbers: 89.75.Hc, 89.75.Fb, 05.10.-a 


2 


1. Introduction 

As the increase of connectivity in and between different complex systems, spread of 
many diseases or virnses is becoming more and more prevalent in onr society[Tl |2l[3lll]. 
For instance, ontbreaks of many infectious diseases, including Severe Acute Respiratory 
Syndromes (SARS), Swine flu (HlNl), and the recent Ebola virus, caused great damage 
and loss of life. The spread of computer and mobile phone viruses brought about 
a great deal trouble to human life and serious damage to economy. Understanding 
the intrinsic mechanisms of those spreading processes and designing efficient control 
strategies become very important and urgent tasks, which bring together a lot of 
researchers from areas of biology, sociology, mathematics, physics, engineering, etc [3]. 

Mathematical modeling of epidemic spreading has a long history of more than two 
hundred yearsjl]. Generally, the population is divided into several classes: susceptible, 
infected and recovered individuals. Susceptible individuals represent those who can 
contract the infection. Infected individuals were previously susceptible individuals and 
got infected by the disease. Recovered individuals are those who have recovered from 
the infection. In the susceptible-infected-susceptible (SIS) modeI|4], infected individuals 
can recover from the disease and become susceptible individuals again. While in the 
susceptible-infected-recovered (SIR) modeI[l], infected individuals no longer get infected 
after recovery from the disease, which are assumed to get the permanent immunity. In 
classical epidemiology, a common assumption is that individuals in a class is treated 
similarly, and have equal probability to contact with everyone else[3l ll]. However, 
the recent abundance of data demonstrates that both the connectivity pattern and 
the contact rate are heterogeneous among real-world complex networks[3l [5], which 
means the traditional deterministic differential equations and many other related results 
of epidemic processes are inadequate in real-world situations. This great stimulates 
the research of epidemics on real-world complex networks[l]. Due to the complexity 
of real-world networks, the mean-held approach [6l 0 [8] and the generating function 
approach j9] are used to drive the analytic results of epidemics spreading. One of the 
remarkable results obtained by Pastor-Satorras and VespignanijH |6] shows that in the 
limit of a network of inhnite size, the epidemic threshold of the SIS model tends to zero 
asymptotically in scale-free networks with power-law parameter in (2, 3]. For SIR model, 
it was found that in the thermodynamic limit, not only the threshold tends to vanish, but 
also the time for the stabilization of the infection becomes very small [TIH ITT] . By using 
the message-passing approach, Karrer and Newman [12] calculated the probabilities for 
any node and any time to be in state S, I, and R on tree structure. Many other 
explicit results of SIR model are obtained by mapping the SIR model to the percolation 
pro cess P IT3l IT^. Also, effects of degree correlations [T^ 116] . clustering [TTl ITS] , weights 
and directions of edges [7] HH] 120] on epidemic spreading are broadly discussed. On the 
other hand, various efficient immunization protocols [U [211122] have been designed for 
controlling the spread of epidemics on networks. Recently much attention has been 
transferred to epidemic spreading in temporal and multiplex networks [23 [ |2Tl |25] . 
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Addition to diseases or viruses, there are usually many other substances spreading 
in networks like information packets, goods, ideas, etc., which depend on the specihc 
types of the networks. Epidemic spreading is often coupled with the delivery of 
these substances. For example, HIV spreads through the exchange of body fluids 
among individuals in contact networks. Computer viruses spread with the delivery of 
information packets in computer networks. Flu often spreads by air traffics among 
different spatial areas. Therefore, understanding the mechanisms of these coupled 
spreading processes and how these processes affect each other is signihcant for designing 
efficient epidemic immunization strategies. Meloni et aiizg hrst studied the effects 
of traffic ffow on epidemic spreading. They found that the epidemic threshold in 
the SIS model decreases as flow increases, and emergence of traffic congestion slows 
down the spread of epidemics. Then, Yang et al[27l EH] further studied the relation 
between traffic dynamics and the SIS epidemic model, and found that the epidemic can 
be controlled by hne tuning the local or global routing schemes. Furthermore, they 
obtained that the epidemic threshold can be enhanced by cutting some specihc edges in 
the network |2^. The impacts of traffic dynamics on SIR epidemic model haven’t been 
reported in literature. In this paper, we study the traffic-driven SIR spreading dynamics 
in complex networks. We focus on the instantaneous size of infected population, and the 
hnal size of ever infected population. Based on these two properties, we study how the 
packets transmission process governed by given routing protocols affects the epidemic 
spreading. 

2. Model 

Our model includes two coupled processes: packet delivery process and the epidemic 
spreading process. We will introduce our model in the context of computer networks. 

2.1. packet delivery process 

We assume that nodes in the network are identical which can generate, receive and 
deliver information packets. Each node has a queue obeying the First-In-First-Out 
(FIFO) rule for storing packets. The length of the queue is set inhnite. Load of a node 
is the number of packets in its queue. Every node generates packets at a rate p. For 
example if p = 1.5, each time an arbitrary node generates one packet dehnitely and 
another one with probability 0.5. The destination nodes of the packets are chosen 
randomly, and the packets will be removed from the network after arriving at the 
destination nodes. The transmission of packets is governed by the efficient routing 
protocol proposed by Yan et aim. For an arbitrary path p of length I between node s 
and d, denoted as (s, ui, 77 - 2 , • • •, n-z-i, d), its routing cost is dehned as follows: 

0 (p) =Y,kf. 

2 = 1 


( 1 ) 
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Where ki is the degree of node f, and a is a control parameter. The sum runs over all 
the intermediate nodes of path p. The efficient paths for delivering packets are dehned 
to be those which have the minimum routing costs. If there are many efficient paths 
between two nodes, we randomly chose one for delivering packets. According to Eq. 1, 
a determines the routing cost of a path. When a > 0, paths with large-degree nodes 
usually have large routing costs. Thus, efficient paths tend to be those paths composed 
of small-degree nodes when a > 0, and vice versa. Each time a node can deliver at 
most C packets. When C = oo, all the packets can be delivered without delay, there is 
no traffic congestion. The overall load of the network is constant after a short transient 
time. When C* is a constant value, there is a critical packet generation rate pc- When 
p < pci there is no traffic congestion. When p > pc, the network generates more packets 
than it can deliver. As a result, the overall load of the network increases with time, 
which is the traffic congestion phenomenon. We use the order parameter to characterize 
the traffic congestion, which is as follows [^: 

^ i i.m hAtMuAA 

pN At 

Where w{t) is the total number of packets in the network at time t. When -0 = 0, the 
generated and delivered packets are balanced, and the network is under the free flow 
state. When -i/; > 0, packets accumulate continuously in the network, which indicates 
that there exists traffic congestion. 

2.2. Epidemic spreading process 

As in the traditional SIR model, nodes in the network are divided into three classes: 
susceptible nodes, infected nodes and recovered nodes. Initially, all the nodes in 
the network are susceptible nodes, which perform the normal function of generating, 
delivering, and receiving packets, and the network flow is stable. Then we randomly 
select a node to be an infected one which is the original source of the infection. Infected 
nodes generate infection packets instead of normal packets at each time step. With 
the delivery of these infection packets, more and more susceptible nodes get infected 
after receiving the infection packets. Infected nodes get recovered and become recovered 
nodes with probability p at each time step. Recovered nodes generate normal packets, 
and they can also make the infection packets into normal packets. Thus, recovered 
nodes block the epidemic spreading. The epidemic spreading ends when all the infected 
nodes become recovered nodes. The transitions between the susceptible, infected, and 
recovered nodes for our model are shown in figure 1. 

3. Results 

The underlying networks are random networks generated by the Erdos-Renyi (ER) 
model [32] and scale-free networks generated by the static model [33]. Also, some real- 
world networks [341 13^ 136] are used in the simulations. 
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Figure 1. Transitions of susceptible nodes, infected nodes and removed nodes. 
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Figure 2. Time evolution of our model on the networks 


3.1. Evolution of our model 

We assume that, at time t, the numbers of susceptible nodes, infected nodes and 
recovered nodes are S{t), I(t) and R{t) respectively. First, we study the time evolution 
of our model on the ER model, the static model, and the Email network|3l]. We add 
the infection to the network at t = 150 when the network is under free flow state, 
by randomly selecting a node to become infected. In hgure 2 (a), (b) and (c), we see 
that S(t) decreases with t greatly, and then tends to be stable. On the contrary, R{t) 
increases with t abruptly, and then converges at Re, which is the maximum number 
of recovered (or ever infected) node. Re reflects the range of the infection. Differently 
from S{f) and R{t), I{t) increases with t hrst, then decreases with t. The peak of I{t), 
denoted by Ip, represents the maximum instantaneous number of infected nodes during 
the epidemic spreading process. Ip reflects how intense the epidemic spreading is. At 
any time t, the sum of S{t), I{t), and R{t) equals the size of the network. 

3.2. Impacts of a 

Routing parameter a is one of the important factors in our model, a determines the 
efficient paths for the delivery of normal and infection packets. In hgure 2(d), (e), and 
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Figure 3. Ip, and cti, vs. a. The network models, the Email network, as well as 
all the parameters are the same as in figure 2. The results are the average over 100 
independent runs. 


(f), we see that S(t), I(t) and R{t) all vary with a, and for any of the three, the trends 
of the curves of different a are similar. We focus on the maximum instantaneous size 
of infected population Ip, and the hnal size of ever infected population Rg. In hgure 3 
(a), (b) and (c), we see that both Ip and Rg increase with a hrst, then decrease with a. 
There are optimal a which correspond to the maximum Ip and Rg respectively. Note 
that the optimal a for Ip and Rg are close, but not necessarily the same. There are 
jumps in both Ip and Rg when a is near zero. To explain these results, we calculate the 
load variance ai of nodes, which is dehned as follows: 

Li = ^T,J=oLi{t), 

(3) 


O-L = 




N 


Where Li{t) is the load of node i at time t. T is a constant value and is large enough 
to ensure accurate calculation of the average load R of node i. When ai is small, the 
load distribution is relatively even in the network, and vice versa. In hgure 3 (d), (e) 
and (f), ai decreases hrst and then increases with a. There is optimal a which leads to 
the minimum cr^. Interestingly, the values of the optimal a for Ip and Rg are close, 
which indicates that homogeneous load distribution facilitates the epidemic spreading. 
There is also abrupt decrease in when a is near zero. This is because the efficient 
paths are very diherent for a < 0 and a > 0, and the load is abruptly redistributed 
from large-degree nodes to small-degree nodes when a increases from below zero to 
above zero. This also accounts for the jumps in Ip and Rg. The results are consistent 
for both the models networks and the Email network as shown in hgure 3. 
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Figure 4. Ip and Re vs. a for different p. The results are the average over 100 
independent runs. 


3.3. Impacts of p 

Packet generation rate p also has great impacts on the epidemic spreading. In hgure 4, 
we see that the peak of Re for p = 0.5 is almost 4 times of the peak of Re for p = 0.2. 
The peak of Ip for p = 0.5 is almost 6 times of the peak of Ip for p = 0 . 2 . Also the 
positions of the peaks for p = 0.1, 0.2 and 0.5 are different. In hgure 5 (a), we clearly 
see both the maximum Ip and the maximum Re increase with p. When p becomes 
large, infected nodes will generate more infected packets, which facilitates the epidemic 
spreading. In hgure 5 (b), we see that aopt decreases with p, which indicates an increase 
of dependency on large-degree nodes. To illustrate this, we focus on the top 1% largest 
degree nodes, and calculate the average ratio p of the number of infection packets a large 
degree node delivered when it was infected to the number of infection packets it cured 
after it recovered. In hgure 6 , rj increases with p, which means that the role large-degree 
nodes play in facilitating the epidemic spreading becomes more and more remarkable 
compared to the role they play in inhibiting the epidemic spreading. This explains why 
the epidemic spreading becomes more dependent on large-degree nodes to spread widely 
when p increases as shown in hgure 5 (b). 

3.f. Impacts of network structure 

First, we show the impacts of network density on the epidemic spreading. In hgure 7 
(a) and (c), we see that the maximum Ip and the maximum Re increase with average 
degree {k) for both random networks and scale-free networks. This indicates that more 
edges facilitate the epidemic spreading. In hgure 7 (b) and (d), aopt decreases with 
(/c), but is large than zero. This means that, to realize an intense and wide epidemic 
spreading, generally the paths for the delivery of infection packets should be biased 
towards including small-degree nodes, but when the network becomes dense, the degree 
of the dependence of large-degree nodes in the transmission of infection packets should 
be increased accordingly. Then we show the impacts of power-law parameter 7 on the 
epidemic spreading in hgure 8 . Both the maximum Ip and the maximum Re increase 
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Figure 5. The maximum /p, the maximum Re and the corresponding aopt vs. p. The 
network model and the parameters are the same as in figure 4. The results are the 
average over 100 independent runs. 



P 

Figure 6. The ratio rj vs. p. The results are the average over 10 independent runs. 


with 7 first, and then tend to be stable. The aopt increases slightly with 7 . We infer 
from fignre 8 that homogeneons network strnctnre facilitates the epidemic spreading. 

3.5. Impacts of traffic congestion 

When delivery capacity (7 is a constant valne, and the packet generation rate p 
is large enongh, the packets can’t be delivered in time, and then the nnmber of 
packets accumulated in the network increases with time, which is the traffic congestion 
phenomenon. In the simulation, C is set to 10, f is calculated according to Eq. 2 to 
quantify the degree of traffic congestion. In hgure 9 (a), we see that only when a is 
in [ 0 . 6 , 1 ], there is no traffic congestion in the network, which is indicated hj f = 0. 
Otherwise, there exists traffic congestion where f > 0. In hgure 9 (b), we see that both 
Ip and Re increase with a hrst, and then decrease with a. The optimal a for Ip and Re 
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Figure 7. The maximum /p, the maximum Rg and the corresponding aopt vs. {k). 
The results are the average over 100 independent runs. 



Figure 8. The maximum Ip, the maximum Rg and aopt vs. 7 . The results are the 
average over 100 independent runs. 


are 0.7 and 0.9 respectively, where there is no traffic congestion. Also, we obtain that 
when the traffic congestion is not serions like a in [1.5, 2] (figure 9 (a)), the epidemic 
spreading can still spread intensely and widely, which is inferred by Ip and Re in figure 
9 (b). Then we fix a to be zero, and study how the maximum Ip and the maximum Rg 
vary with p. In figure 10, we obtain that both the maximum Ip and the maximum Rg 
increase first and then decrease with p, which is consistent for both random networks 
and scale-free networks. The reason for these results is that when p is small, there is 
no traffic congestion, and the infection becomes intense and wide spread with increase 
of p. However, when p is large enough, traffic congestion appears, which inhibits the 
epidemic spreading. 
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Figure 9. ip, Ip and Re vs. a. The results are the average over 100 independent runs. 



p p 


Figure 10. The maximum Ip and the maximum Rg vs. p. The results are the average 
over 100 independent runs. 


3.6. Impacts of routing protocols 

In addition to the efficient routing protocol, we also study the impacts of the other 
static routing protocols on the epidemic spreading. If k is replaced with Ig k in Eq. 
1, then we get the cost function of the optimal routing protocol [37]. For the optimal 
routing protocol, we only present the results of Ip and Re vs. a on three real-world 
networks |341135] 13^ in hgure 11. There are also optimal a which correspond to the 
maximum Ip and the maximum Re respectively. When a is near zero, there are also 
jumps in Ip and Re due to the signihcant change of paths for delivering packets. For 
the optimal routing and the efficient routing, either the maximum Ip or the maximum 
Re is very close. The difference lies in that Ip and Re of the optimal routing vary much 
slower than that of the efficient routing. 

4. Conclusion 

In summary, we propose a traffic-driven SIR epidemic model and study the impacts of 
several factors on our model. We hnd that the epidemic spreading is greatly affected 
by the load distribution, and homogeneous load distribution facilitates the epidemic 
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X axis: a 

Y axis: number of nodes 
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Figure 11. Ip and Re vs. a for the optimal routing and the efficient routing on three 
real-world networks. The results are the average over 100 independent runs. 


spreading. Increasing the network density or network homogeneity will enhance the 
epidemic spreading. Large-degree nodes have dual effects on the epidemic spreading, 
since large-degree infected nodes facilitate the epidemic spreading, while large-degree 
recovered nodes greatly inhibit the epidemic spreading. To realize an intense and wide 
epidemic spreading, the paths for the delivery of packets are generally biased towards 
including small-degree nodes. Increasing packet generation rate generally favors the 
epidemic spreading. However, when the amount of generated packets is larger than 
the delivery capacity of the network, there will exit traffic congestion, which blocks the 
epidemic spreading. Also, we find similar impacts of different static routing protocols on 
the traffic-driven SIR epidemic spreading. Our work helps understanding the interplay 
between traffic dynamics and epidemic spreading, and provides some clues for network 
immunization. 
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